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e Combines multiple Gaussian 
(normal) distributions 
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e Useful when there is no prior 
knowledge of the number of 
clusters or when the clusters have 
different shapes and sizes 
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Outlier e Models the data distribution and 
y then identifies the data points 
Detection that are far from the distribution 


| 
e Models the normal data 
distribution and then identifies 
data points that are unlikely to 
have come from that distribution 
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e Used in computer vision to 


Real-world segment images into different 
applications regions based on color or texture 
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Flexible 


sE Represents complex data distributions 
using a combination of simpler Gaussian 
(normal) distributions 


Represents a dataset as a weighted sum of 
Gaussian distributions, where each 
distribution has its own mean and variance 


= Combination of multiple Gaussian 
distributions allows the GMM to capture a 
variety of patterns and structures in the 
data 


Clustering 


The parameters of the Gaussian 
distributions (i.e., the mean and variance) 
are estimated using the Expectation- 
Maximization (EM) algorithm 


Outlier Detection 


Models the underlying distribution of the 
data and identifies data points that are 
unlikely to have been generated by that 

distribution 
Once the GMM is fit to the data, the 
likelihood of each data point being 
generated by the GMM is calculated 


Data points that have a low likelihood of 
being generated by the GMM are 
considered outliers 


Anomaly Detection 


Em Can be used for anomaly detection by 
identifying data points that are significantly 
different from the underlying distribution 
of the data 


In GMM-based outlier detection, the focus 
is on identifying points that are far away 
from the cluster centers of the model, or 

that have low probabilities of belonging to 

any of the clusters. 
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Real-world 
applications 


In segmentation, GMMs are used to model 
the distribution of pixel intensities in an 
image and separate the foreground from 

the background 


In speech recognition systems to model the 
distribution of speech features, such as 
mel-frequency cepstral coefficients 
(MFCCs), and classify them into phonemes 
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